Determination of the Optimum Degree of Redundancy for Fault-prone Many-Core Systems
نویسنده
چکیده
The increasing transistor integration capacity will entail hundreds of processors on a single chip. Further, this will lead to an inherent susceptibility to errors of these systems. To obtain reliable systems again, various redundancy techniques can be applied. Of course, the usage of those techniques involves a significant overhead. Therefore, the identification of the optimal degree of redundancy is an important objective. In this paper we focus on core-level redundancy and checkpointing rollback-recovery. A model to determine the optimal degree of spatial and temporal redundancy regarding the minimal expected execution time will be introduced. Further, we will show that in several cases, the minimal expected execution time is achieved just by a simultaneous combination of both techniques, spatial redundancy and temporal redundancy.
منابع مشابه
An approach to fault detection and correction in design of systems using of Turbo codes
We present an approach to design of fault tolerant computing systems. In this paper, a technique is employed that enable the combination of several codes, in order to obtain flexibility in the design of error correcting codes. Code combining techniques are very effective, which one of these codes are turbo codes. The Algorithm-based fault tolerance techniques that to detect errors rely on the c...
متن کاملDetermination of optimum of production rate of network failure prone manufacturing systems with perishable items using discrete event simulation and Taguchi design of experiment
This paper, considers Network Failure Manufacturing System (NFPMS) and production control policy of unreliable multi-machines, multi-products with perishable items. The production control policy is based on the Hedging Point Policy (HPP). The important point in the simulation of this system is assumed that the customers who receive perishable item are placed in priority queue of the customers w...
متن کاملDETERMINATION OF OPTIMUM LOCATION FOR FLEXIBLE OUTRIGGER SYSTEMS IN NON-UNIFROM TALL BUILDINGS USING ENERGY METHOD
In this paper, based on maximizing the outrigger-belt truss system’s strain energy, a methodology for determining the optimum location of a flexible outrigger system is presented. Tall building structures with combined systems of framed tube, shear core, belt truss and outrigger system are modeled using continuum approach. In this approach, the framed tube sys...
متن کاملReliability Optimization for Complicated Systems with a Choice of Redundancy Strategies (TECHNICAL NOTE)
Redundancy allocation is one of the common techniques to increase the reliability of the bridge systems. Many studies on the general redundancy allocation problems assume that the redundancy strategy for each subsystem is predetermined and fixed. In general, active redundancy has received more attention in the past. However, in real world, a particular system design contains both active and col...
متن کاملFault Tolerant Reversible QCA Design using TMR and Fault Detecting by a Comparator Circuit
Quantum-dot Cellular Automata (QCA) is an emerging and promising technology that provides significant improvements over CMOS. Recently QCA has been advocated as an applicant for implementing reversible circuits. However QCA, like other Nanotechnologies, suffers from a high fault rate. The main purpose of this paper is to develop a fault tolerant model of QCA circuits by redundancy in hardware a...
متن کامل